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ABSTRACT 

Using an eigenspectrum decomposition technique, we separate the host galaxy from the broad line 
active galactic nucleus (AGN) in a set of 4666 spectra from the Sloan Digital Sky Survey (SDSS), from 
rcdshifts near zero up to about 0.75. The decomposition technique uses separate sets of galaxy and 
quasar eigenspectra to efficiently and reliably separate the AGN and host spectroscopic components. 
The technique accurately reproduces the host galaxy spectrum, its contributing fraction, and its 
classification. We show how the accuracy of the decomposition depends upon S/N, host galaxy 
fraction, and the galaxy class. Based on the eigencoefficients, the sample of SDSS broad-line AGN 
host galaxies spans a wide range of spectral types, but the distribution differs significantly from 
inactive galaxies. In particular, post-starburst activity appears to be much more common among 
AGN host galaxies. The luminosities of the hosts are much higher than expected for normal early- 
type galaxies, and their colors become increasingly bluer than early-type galaxies with increasing host 
luminosity. Most of the AGNs with detected hosts are emitting at between 1% and 10% of their 
estimated Eddington luminosities, but the sensitivity of the technique usually does not extend to 
the Eddington limit. There are mild correlations among the AGN and host galaxy eigencoefficients, 
possibly indicating a link between recent star formation and the onset of AGN activity. The catalog 
of spectral reconstruction parameters is available as an electronic table. 
Subject headings: quasars: general — galaxies: active — surveys — techniques:spectroscopic 



1. INTRODUCTION 

Quasars, and active galactic nuclei (AGNs) in general, 
are known to exist at the centers of galaxies, and it is 
believed that accretion of galactic material onto a su- 
permassive black hole is the primary mechanism of thei r 
energy generation (e.g. ILvndeu-Belll IT969t iReTsl |l984) . 
Numerous studies now suggest a close link between the 
growth of supermassive black holes and normal galax- 
ies, in particular a correlation betw een black hole m ass 
and galactic stellar bulge mass (e.g. !Ferraresell2002t for 
a review). An understanding of the physical connection 
between the AGN central engine and host galaxy requires 
a description of the properties of both the AGN and host 
separately, and in relation to one another, across a wide 
range of intrinsic properties. However, it is observation- 
ally difficult to study the properties of both an AGN and 
its host galaxy. At high AGN luminosities (Mb < — 23, 
traditionally defined as the quasar regime), the contrast 
between the AGN and host galaxy is so great that the 
host cannot be easily discerned from the AGN. At low 
AGN luminosities, the AGN cannot be observed without 
significant contamination from the host unless very high 
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spatial resolution techniques are employed. 

Much effort has been applied to the development 
of techniques for the spatial i mage decomposit i on of 
AGNs and h qst galaxies (e.g. iKuhlbrodt et all 120041 
iMcLure et all 120001 iWadadekar et all Il999f >~ The re- 
sults from image decomposition studies generally show 
that AGNs are usually found in bulge-dominated galax- 
ies at the bright end of the luminosity function 
(e.g. [Smith et alJll986llpunk>p et al.ll2003l iFalomo et alJ 
l2004UFlovd et al]l2004jk this is especially true for ra- 
dio loud AGNs and AGNs with higher lu minosities 
(e.g. lHamilton et alJl2002T: iDunlop et al.ll2003[) . Multi- 
band image studies also show that the stellar pop- 
ulations of the host galaxies tend to be bluer, and 
therefore are likely to be younger, than inactive galax- 
ies with th e same morphological and luminosity char- 
acteristics dHutchings et alJ l2002t iSanchez et al.l 120041 



Jahn ke~et alJl2004al) . There is also evidence for merg- 
ers o r other interactions in some of the host galaxy im- 
TT ' '"" q ' Bahcall et alJll997l iMalkan et alJ 
120011 IMcLeod fc McLeod|T2001 

"HIT 



ages llHutchingslll987 
19981 iMarauez et al " 



Sanchez & Gonzalez Serrandl2OT3USanchez et al1l2004|k 
although it is not clear that host galaxies are more 
likely to be interacting than their inact ive counterparts 
l|Schade et al.ll2000l DrnTlop et alJl2003^ . 

Spectroscopic studies of AGN host galaxies have 
usually been confined to the outer parts of the 
galaxies, well away from the contaminating nu- 
clear r egions llBoroso n et alJ 119821 iBa lick fc Heckmanl 
Il983t iBoroson fc Okd 11984 IBoroson et al l [1985 1 
Hutching^TlT'Cr anipTonl 1199ft iNoTarT^ltHrtT 120011 
Mille r fc Sheinisl l2003j) . Some integrated-field spec- 
troscopy and other spectroscopic AGN removal tech- 
niques have also bee n performed on a small number of 
AGN host galaxies ijCourbin et alJ 120021 Uahnke et"aTl 
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2004 r j). These studies ha ve revealed both old 
llNolan et al]l200ll) and young <)Hutchings fc Cramptonl 
1990) stellar populations. Results supporting the inter- 
pretation of host galaxies as massive bulge-dominated 
systems with relatively you n g stel lar populations were 
found by iKauffmann et all l|2003f) . who analyzed the 
nuclear spectroscopic properties of over 20, 000 narrow- 
line (type 2) AGNs found in t he Sloan Digital Sky 
Survey (SDSS lYork et alJ l200Ch . They found that 
nearly all type 2 AGNs reside in massive galaxies with 
"early-type" structural properties, but that the age, 
starburst fraction, and ionization state of the AGN 
depends on the [Om] emission luminosity. Because 
type 2 AGN spectra conta i n littl e or no continuum com- 
ponent, IKauffmann et alJ (2003) were able to separate 
cleanly the stellar from emission line contributions to 
the spectra using pure stellar galaxy templates (they 
did not need to remove the broad-line components of 
AGNs). 

Recently, IHao et all l|2005a|) and IDong et alJ (j2005T) 

demonstrated that galaxy stellar and emission line spec- 
tral components can be separated in narrow-line and 
some broad-line AGNs by using galaxy eigenspectra and 
a power-law continuum for a possible AGN continuum 
component. The technique proved effective for isolat- 
ing the emission line components (from the galaxy, a 
possible AGN, or both) that were th en used to classif y 
the galaxy as star forming or AGN l|Hao et al J 12005a 1 ). 
Spectral principal compon ent analysis (PC A) has been 
performed on bot h galaxy ijYip et al.ll2004aj) and quasar 
ijYip et al1l2004b|) samples from the SDSS. These studies 
showed that galaxies and quasars can be classified based 
on only two or three eigencoefficients. At low luminosi- 
ties, the quasar second eigenspectrum has a strong galac- 
tic component resembling the first galaxy eigenspectrum. 
These results suggest that a combination of both galaxy 
and quasar eigenspectra may be effective in separating 
the components of composite AGN/galaxy spectra. 

In this paper we describe such a technique, and show 
that the galaxy and AGN components can be reliably dis- 
entangled over a wide range of galaxy types and galaxy to 
AGN flux fractions, even in spectra with modest signal- 
to-noise ratios. The technique has alr eady been ap- 
plied t o a small sample of SDSS AGNs bv lStrateva et alJ 
(2005), who used it to estimate AGN continuum lumi- 
nosities free of host galaxy components. Here we ap- 
ply the technique to over 11, 00 AGN spectra from th e 
SDSS third data release (DR3. lAbazaiian et alJl2005|) . 
The properties of the AGNs and host galaxies can be 
studied separately or in relation to each other. In addi- 
tion, the large sample size makes it possible to examine 
population statistics of broad-line AGNs and host galax- 
ies in ways not possible previously. 

The focus of this paper is on broad- line ("type 1") 
objects, which we will generically refer to as AGNs, re- 
gardless of luminosity. Quasars are defined as broad-line 
AGNs with luminosities brighter than M, = —22.0 (fol- 
lo wing the definition giv en in the SDSS quasar catalog 
bv lSchneider et alJl200ffl . 

The SDSS AGN survey and dataset are described in 
§ 2. The eigenspectrum spectral decomposition technique 
is described along with the results of simulation tests 
in § 3. The application of the technique to the SDSS 
AGN spectra is described in § 4, and some characteris- 



tics of the AGN and host populations are described in 
§5. The results are discussed and summarized in §6. 
Throughout the paper we assume a flat, A-dominatcd 
cosmology with parameter values f2 m = 0.3, = 0.7 
and H = 70kms _1 Mpc _1 . 

2. THE SDSS AGN SAMPLE 

2.1. The Main SDSS Quasar Survey 

The broad-line AGNs used for this study were se- 
lected from the SDSS. The SDSS is a project to im- 
age of order 10 4 deg 2 of sky, mainly in the northern 
Galactic cap, in f ive br oad photometric bands (it, g, r, i, z, 
l|Fukugita et all 119961) ') to a depth of r - 23, and 
to obtain spectra of 10 6 galaxies and 10 5 quasars se- 
lected from the imaging survey. Imaging observations 
are made with a dedicated 2.5m telescope at Apache 
Point Observator y in New Mexico, using a large mo- 
saic CCD camera (jGunn et al.l li~998') in a drift-scanning 
mode. Absolute astrometry for point sources is ac- 
curate to better than 100 milliarcseconds rms per co- 
ordinate ijPier et alJ [2003) . Site photometricity and 
extinction monitoring are carried out simultaneously 
with a dedicated 20-inch telescope at the o bserving site 
(Hogg ~Finkbeiner. Schlegel. fe Gunnll2001[) An assess- 
ment of the image data quality is given bv llvezic et alJ 
(2003). 

The imaging data are re duced and calibrat ed using 
the photo software pipeline ijLupton et al.ll200l|) . In this 
study we use three types of magnitudes, two derived from 
imaging data, and one from spectroscopic data. The 
first magnitude we use is the point-spread function (PSF) 
magnitude, which is derived from a Gaussian fit to the 
object and a series of aperture corrections using a spa- 
tially varying PSF model llStoughton et al.ll2002tK W e 
also use "cmodel" magnitudes l)Abazaiian et al.| [20041. 
which are derived from a linear combination of de Vau- 
couleurs and exponential profile fits to object images. 
The SDSS photometric system is normalized so that the 
u, <?, r, i, z magnitudes are approximately on the AB sys- 
tem (Oke & Gunn 1983; Fukugita et al. 1996; Smith et 
al. 2002; c.f. discussion by Abazajian et al. 2004). The 
photometric zeropoint calibration is accurate to better 
than 2% (root-mean-squared) in the g, r, and i bands, 
and to better than 3% in the u and z bands, mea- 
sured by comparing the photometry of objects in scan 
overlap regions. Spectrosc opic targets are select ed by 
a series of algorithms (see iStoughton et al J 120021 ) . and 
are grouped by thre e degree diameter areas or "tiles" 
(Blan ton et alJ l2003[) . Two fiber- fed double spectro- 
graphs can obtain 640 spectra for each tile; for the 
main survey each tile contains 32 sky fibers, and roughly 
500 galaxies, 50 quasars, and 50 stars. The wavelength 
range of the SDSS spectra covers approximately 3800A 
to 9200A at a spectroscopic resolution of about 1800. 
The AGN spectra used here were corrected for Galac- 
tic extinction using the reddening m ap constructed by 
iSchlegel. Finkbeiner. fc Davisl l|1998fl . and the average 
Milky Way extinction curve described by iFitzpatrickl 
(1999). "Spectroscopic magnitudes" are derived by con- 
volving the calibrated spectra with the SDSS g, r, and 
i filter transmission curves that include 1.3 airmasses of 
extinction. 

Quasar candidates are selected from the SDSS color 
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space and unr esolved matches to sources in th e FIRST 
radio catal og l)Becker. White. & Helfandl 119951k as de- 
scribed by iRichards et al.l l)2002|) . Quasars are also 
often identified because the objects were targeted for 
spectroscopy by non-quasar selection algorithms, such 
as op tical matches to ROSAT sources l)Stoughton et alJ 
1200a I Anderson et al J 1200^1 . various classes of stars 



so-called seren dipity objects 
and galaxies (jStrauss et al.l 



(IStoughton et al 
( Stought on et al 
2002). The images of quasar candidates selected as 
likely "low-redshift" (z < 3) objects, are allowed to 
be either resolved or unresolved. The SDSS photo 
pipeline classifies object images as resolved or unre- 
solved based on the difference betwe en PSF and cmodcl 
magnitud es llAb azaiian et al.l l2004|) (see also discus- 
sions by IStoughton et alJ 120021 IScranton et al.l 120021: 
iStrauss et alJl2002j) . It is important for this study that 
resolved objects can be quasar candidates, because a 
large fraction of AGN spectra for which a host compo- 
nent can be detected have extended image profiles (see 
§ 4.1). The completeness of the SDSS quasar selection al- 
gorithm is close to 95% up to the i band li miting survey 
magnitude of 19.1 l|Vanden Berk et al]l2005|K 

2.2. Selection of SDSS AGN Spectra 

The primary se t of quasars was s electe d from the cat- 
alog compiled bv ISchneider et all (|2005). The catalog 
contains 46, 420 quasars, and comprises what is believed 
to be nearly all of the verified quas ars in the SDSS third 
data release (lAbazaiian "et~aTf 2005h Quasars are defined 
in the catalog to be those extragalactic objects with ab- 
solute PSF i band magnitudes brighter than Mj = —22, 
and with at least one emission line having a FWHM 
larger than 1000 km s -1 . The absolute magnitudes were 
calculated from the observed PSF magnitudes without 
any attempt to subtract possible host galaxy contribu- 
tions. Near the faint luminosity limit of the catalog, 
it is apparent from inspection that many of the quasar 
spectra contain host galaxy components. The fraction 
of quasar spectra in the catalog with clear galaxy com- 
ponents decreases with increasing quasar luminosity (a 
point that will be quantified in §4.1). 

The limit of Mi = —22 (and most other traditional 
limits) for the definition of a quasar is fairly arbitrary, 
and broad-li ne AGNs are cl early present at fainter mag- 
nitudes fe.g. lHo et alJ fl997). It is expected that the host 
galaxy fraction is larger for the fainter AGNs. For this 
study we have extended the catalog of ISchneider et alJ 
( 2005) by simply removing the absolute magnitude cri- 
terion. The AG Ns were selected from the initial search 
list described by Schneider et al. (2005), by inspecting all 
of the extragalactic objects that were rejected from the 
quasar catalog because they did not meet the absolute 
magnitude limit. All of the AGNs were still required 
to pass the emission line width criterion. This search 
resulted in a sample of 3584 low-luminosity broad-line 
AGNs. 

We will refer to the combined sample of quasars and 
low-luminosity broad-line AGNs as the SDSS AGN sam- 
ple. The sample contains over 50, 000 broad-line AGNs. 
Host galaxy components will not be detectable in the 
majority of the AGN spectra. At redshifts beyond 0.752, 
the wavelength range of the galaxy eigenspectra covered 
by the SDSS spectra is too small for a reliable spectro- 



scopic reconstruction (§3.5). The initial data set there- 
fore consists only of those AGNs with z < 0.752. Spectra 
were also rejected for further analysis if more than 50% 
of their pixels inside either of two wavelength regions 
were flagg ed as potentially bad by the spectro pipeline 
( see IStoughton et alJ ((2002) for a list of flags). The two 
wavelength regions are 4160 < A < 4210A, where inte- 
grated flux densities are measured (§ 3.2), and the wave- 
length region at which an AGN spectrum and the galaxy 
and quasar eigenspectra overlap, which depends upon the 
rcdshift of the AGN. After the redshift and good pixel 
criteria are applied, the initial data set consists of 11, 647 
AGN spectra. 

3. EIGENSPECTRUM DECOMPOSITION OF AGN AND 
HOST GALAXY SPECTRA 

The technique we employ to separate the spectroscopic 
components of composite AGN and host galaxy spectra 
uses separate sets of quasar and galaxy eigenspectra in 
linear combination. One of the goals of eigenspectrum 
analysis (often also called principal component analysis, 
PCA, or Karhunen-Loeve transformation, KL) is to re- 
duce the complexity of a dataset (compression) by con- 
structing a (hopefully) small set of orthogonal eigenspec- 
tra that account for the bulk o f the va riati ons within 
a samp le of objects. lYip et all l)2004aj) and lYip et all 
( 2004bj) showed that the vast majority of the spectra of 
galaxies and quasars in the SDSS could be described by 
only a relatively small number (< 10) of eigenspectra. In 
this section we demonstrate that spectra with significant 
contributions from both an AGN and a host galaxy can 
be effectively decomposed using existing sets of quasar 
and galaxy eigenspectra. 

3.1. The Quasar and Galaxy Eigenspectra 

The eigenspectra em ployed here are t hose described 
and made available by lY^p^t^l J20 i 0^|) for the SDSS 
galaxy sample, and by lYip et all {2004b) for the SDSS 
quasar sample. The amount of information contained in 
each eigenspectrum is given by the relative amplitude of 
its eigenvalue. The galaxy sample variance is strongly 
concentrated in just the first few eigenspectra, with over 
98% of t he information con tained in the first three eigen- 
spectra l|Yin et aljl27)04a1h The quasar information is 
also concentrated, but not as strongly as in the galaxy 
case, with « 92% of the s ample variance acc ounted for by 
the first ten eigenspectra. lYip et all l)2004b|) showed that 
the resulting quasar eigenspectra depend on both redshift 
and luminosity, which causes a reduction in the informa- 
tion content of the top global eigenspectra (formed from 
the entire sample). Eigenspectra formed from subsets 
of quasars separated into bins of redshift and luminos- 
ity have a much more concentrated information content; 
meaning that fewer of these "local" eigenspectra are re- 
quired to describe a quasar within a restricted redshift 
and luminosity bin. It was also shown that eigenspec- 
tra from a given bin can be used to reliably reconstruct 
spectra in adjacent bins, allowing re dshift or luminosity 
trends to be tracked consistently l)Yip et alJl2004bl) . 

For our application, most of the AGN/galaxy spectra 
will be at redshifts z < 0.5, so we primarily use the 
set of quasar eigenspectra in the "ZBIN 1" lo w redshift 
bin, sp anning 0.08 < z < 0.53, defined by lYip et all 
l)2004b|) . It is also important that there is little or no host 
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galaxy contamination in the quasar eigenspectra, so that 
a galaxy component is reconstructed only by galaxy (and 
not quasar) eigenspectra. Therefore we use eigenspectra 
constructed from quasars in the "CI" hig h luminosity bi n 
(in the low redshift range), defined bv lYin et"aH {2P04b), 
which is sufficiently luminous that there is little evidence 
for any host galaxy component in the top eigenspectra; 
this bin spans an absolute magnitude range of —24 > 
Mi > -26. 

Galaxy or quasar spectra can be reconstructed as linear 
combinations of eigenspectra 



^a fe e fe (A), 
fc=i 



(1) 



where is the reconstructed flux density as a func- 
tion of wavelength A, and the ak are the eigencoefHcients 
of the corresponding wavelength dependent eigenspec- 
tra efc(A). Because the information content of the eigen- 
spectra is concentrated in the first several modes, a rela- 
tively small number of eigenspectra can be used to closely 
approximate a given spectrum, while at the same time 
suppressing much of the spectral noise, and interpolat- 
ing over small gaps in the spectra flConnqllv et al.lll995t 
IConnollv fc Szalavlll999UYip et al.ll2004aj) . 

Following Connollv et al. (f995), we define two angles, 
called classification angles, which are formed from the 
first three galaxy eigencoefficients. The first 4>, is the 
mixing angle of the first two eigencoefficients, a\ and ci2, 
of a galaxy spectrum 



4> = tan 1 (a2/ai). 



(2) 



The second angle 8, is formed from the third eigencoeffi- 
cient 03, of a galaxy spectrum 

6> = cos- 1 (a 3 ). (3) 

The eigencoefficients are normalized such that 



E 

fc=i 



1 . 



(4) 



where m is the total number of eigenspectra used for a 
reconstruction. It was shown bv IConnollv et all l)1995f) 
that the spectral type of a galaxy is tightly corre- 
lated with 0, whil e post-starburst activity can be dis- 
crimi nated with 9 l)Connollv et alJl995t lCastandc r et al.l 
|2001[) . The ability of the spectral decomposition method 
to accurately recover the galaxy classification angles from 
spectra with a wide range of c haracterist i cs is on e of the 
primary tests of the technique. lYip et aTl l|2004bf) showed 
that the first two eigencoefficients of quasar decomposi- 
tions can be used to classify quasars (although the clas- 
sification angle formed from the eigencoefficients was not 
explicitly defined). We define the classification angles for 
both host galaxies and quasars/ AGNs in the same way, 
according to equations[2JE] an d denote them with the 
subscripts H and A respectively: 4>h,9h,4>a, andf?^. 

To test the dependence of the spectral decomposition 
method on signal-to-noise ratio and host galaxy frac- 
tion, galaxy and quasar templates with known eigenco- 
efficients were modified in controlled ways and the effect 
on the resulting eigencoefficients were examined. The 
galaxy templates we used are the six h igh-S/N templates 
(labeled a through /) constructed bv lYip et al.l ((2004a) 
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Fig. 1. — The difference between the true gafaxy classification 
angles and best-fit angles as a function of the host galaxy frac- 
tion for noiseless input spectra. The dotted line shows the tf>H 
differences, and the solid line shows the 9u differences. The input 
spectra were constructed by combining the quasar template with 
the galaxy template from the —12° < <f>H < —8° range. 



from objects in specific regions of the projected classi- 
fication plane. These were constructed to correspond 
to the spectra of each normal galaxy typ e in th e atlas 
of nearby galaxies compiled bv iKennicuttl l)1992fl . Each 



of the templates was reconstructed using the first five 
galaxy eigenspectra to obtain the eigencoefficients. A 
quasar template was made for this study by selecting 
a representative spectrum from the CI quasar redshift- 
luminosity bin, and reconstructing it using the first 10 
quasar eigenspectra. The spectrum was selected to have 
a high S/N, and inspected to ensure that there was no 
indication of a host galaxy contribution. 

3.2. Recovery of Classification Angles from Noiseless 

Spectra 

One concern about the use of two separate sets of eigen- 
spectra is that they may not be orthogonal in combina- 
tion. That is, a composite host galaxy-quasar spectrum 
may not be uniquely described by a linear combination 
of the eigenspectr a from the two different bases. While 
lYip et all l)2004b|) showed that the second global quasar 
eigenspectrum resembles a galaxy spectrum, it still con- 
tains quasar features (e.g. some broad emission line com- 
ponents), and the full galaxy component may be spread 
across many eigenspectra. Therefore, there is no guaran- 
tee that the quasar and galaxy components of composite 
spectra can be reliably reconstructed using the eigenspec- 
trum method. 

The reliability of the spectral reconstruction of com- 
posite host-quasar spectra was first tested for the ideal 
case of noiseless spectra. Galaxy template d (from the 
range -12° < <j) H < -8° and 80° < H < 100°) was se- 
lected for this test because it is derived from the region 
with the highest density of sample galaxies. The recon- 
structed galaxy template was combined with the quasar 
template at various levels of fractional contribution. The 
fractional contribution of the host galaxy to the compos- 
ite spectrum Fh, is defined by the integrated flux densi- 
ties of the reconstructed quasar and galaxy components 
over the rest wavelength range 4160 < A < 4210A, 



F H = 



r-4210 n 
J4160 J- 



J4I6O (f\,A + d ^ 



(5) 
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Fig. 2. — Average reduced x 2 values for reconstructed spectra 
compared to template spectra for varying numbers of quasar and 
galaxy eigenspectra. The value of \ 2 ranges from 1.2 at Na = 1 
and Njj = 1, to 1.0 at Na = 10 and Njj = 5. The average reduced 
X 2 value for each point on the grid was calculated from fits to 1000 
simulated spectra. 




60 80 100 120 140 160 

6h, degrees 



Fig. 3. — The measured galaxy classification angles for the 
six noiseless templates, using Njj = 5 eigenspectra (circles), and 
Nh = 10 eigenspectra (x's). The boxes show the regions from 
which galaxy spectra were selected to form the templates. Labels 
show the general trend of galaxy type with <f>jj. 



The wavelength range was chosen because it avoids ma- 
jor galaxy stellar absorption lines as well as strong quasar 
emission lines, and it is covered by all of the SDSS spectra 
that can be usefully reconstructed. The noiseless spec- 
tra were fit with the combined sets of quasar and galaxy 
eigenspectra, using the first five galaxy and ten quasar 
eigenspectra, and the classification angles 4>h, Oh, 4>a and 
9a were calculated. (It is shown in § 3.3 that three galaxy 
and five quasar eigenspectra are sufficient for the recon- 
struction, and using more does not change the results 
significantly.) Figure H shows the difference between the 
true angles and best-fit angles as a function of the host 
galaxy fraction for the noiseless input spectra. It is ev- 
ident that there is no significant difference between the 
true and measured classification angles at any level of 
host galaxy contribution. Similar results were found for 
the quasar classification angles. The fitting technique re- 
liably reconstructs the galaxy and quasar components in 
the ideal case of noiseless spectra. 

3.3. The Number of Eigenspectra 

The number of quasar and galaxy eigenspectra to use 
in the fitting process is an important issue. With too few 
eigenspectra, it will not be possible to reconstruct all of 
the significant details in a spectrum. The use of too many 
eigenspectra can lead to "overfitting" a spectrum, so that 
eigenspectra of higher orders begin to fit noise features. 

The optimal number of eigenspectra will clearly de- 
pend upon the quality of a spectrum. In § 3.4 we discuss 
the dependence of the reconstructions on S/N and host 
galaxy fraction. Here we examine the number of eigen- 
spectra that should be used by fixing the S/N and host 
fraction at values typical of the spectra in the SDSS. The 
same galaxy and quasar templates used in the previous 
section were combined at a galaxy fraction of 30%, and 
Gaussian noise was added to achieve a S/N per pixel of 
10 (for SDSS spectra, this translates to a S/N per resolu- 
tion elemental « 12). The template was fit using varying 
numbers of both host galaxy and quasar eigenspectra, 
Nh and Na respectively, and the quality of each fit was 
measured by calculating the reduced \ 2 value of the fit. 
Figure[21 shows the reduced \ 2 values of the fits for each 



combination of the numbers of quasar and galaxy eigen- 
spectra. The value of % 2 ranges from 1.2 at Nh — 1 and 
N A = 1, to 1.0 at N H = 5 and N A = 10. The X 2 value 
drops significantly with increasing numbers of eigenspec- 
tra, until about three galaxy and five quasar eigenspec- 
tra, at which point the values do not change greatly with 
the addition of more eigenspectra. Therefore, for the 
template used in this test, there is no justification for 
using more than three galaxy and five quasar eigenspec- 
tra. For the AGNs in our sample, we have generally used 
five galaxy and ten quasar eigenspectra, which are more 
than needed in most cases, but not too many that the 
spectra are overfitted (the reduced \ 2 values do not drop 
significantly below unity). 

The result that only a relatively small number of eigen- 
spectra are required is not unexpected. The first three 
galaxy eigenspectra acc ount for over 98% of the variation 
in SDSS galaxy spectra <|Yin et al.l2004aft . and the first 5 
quasar eigenspectra in the low-redshift, high luminosity 
bin account for 99.8% of the SDSS quasar spectral vari- 
ation in that bin. More subtle variations accounted for 
by higher orders are often not apparent in spectra unless 
the S/N level is very high. 

While a small number of eigenspectra is sufficient for 
the most common galaxy types, a larger number will be 
necessary for rare galaxy t ypes, particularly those with 
very strong emission lines l|Yip et alJl200 4a1. To illus- 
trate this, Fig.|2| shows the measured classification an- 
gles for all six galaxy templates using both Nh = 5 and 
Nh = 10 galaxy eigenspectra. No noise or quasar compo- 
nent was added in this test. The measured classification 
angles are almost indistinguishable in the two cases for all 
five templates with 4>h > —40°. However there is a dif- 
ference of about 30° in 4>h and about 15° in 9h between 
the measurements for the extreme emission line galaxy 
template. The measurement made using the larger num- 
ber of eigenspectra is clearly better in that case. Because 
of this, in cases with an initial Nh = 5 measurement of 
4>h < —40°, the spectra used in this study were decom- 
posed a second time using Nh = 10 host eigenspectra. 

3.4. Dependence on Signal-to-Noise and Host Galaxy 
Fraction 
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The reliability of the component reconstruction clearly 
depends upon the S/N of a spectrum and the relative 
contribution of each component. To quantify this depen- 
dence, we tested the routine on simulated spectra gen- 
erated as described above but with varying amounts of 
noise and host galaxy contribution. For these tests, the 
quasar template was combined with the —12° < ipn < 
—8° galaxy template. The galaxy template fraction Fh 
was varied from near zero to near one. The spectroscopic 
noise level was varied to achieve a S/N per pixel range 
between 5 and 30. Random noise was added to each com- 
posite template spectrum according to the desired S/N 
level. The spectra were decomposed using Nh = 5 and 
Na = 10 eigenspectra, and the classification angles were 
measured. This was repeated 1000 times for each com- 
posite template spectrum, and the rms dispersion of the 
4> and 9 values were calculated for each combination of 
galaxy fraction and S/N. 

The results for the dispersion of the (f>H and Oh mea- 
surements are shown in Fig.^J As expected, the disper- 
sion decreases for increasing S/N and galaxy fraction. 
More interesting is the fact that even at modest S/N 
and galaxy fraction levels (say S/N = 10 and Fh = 0.3), 
the dispersions are only a few degrees for each angle. 
That is small enough to distinguish almost all of the 
galaxy templates from each other. Figure [S] shows the 
90% confidence ellipses (containing 90% of 1000 random 
realizations) in the 4>h — 9h plane for a range of values of 
Fh, for simulations using the —12° < cf>H < —8° galaxy 
template, with S/N — 10. The values of (f>H and 9h 
are clearly covariant, but the confidence ellipses shrink 
rapidly with increasing host galaxy fraction. There are 
also no systematic offsets in the values of the angles at 
any galaxy fraction. Finally, Fig. shows the measured 
values of (f>n and 9h for the 1000 simulations of all six 
templates with S/N — 10 and Fh — 0.3. (The points 
are not centered in the middles of the boxes because the 
distributio ns of real galaxies are not uniform across the 
boxes; see lYip et all ll2004bj) .) All of the galaxy tem- 
plates can be distinguished from each other, except for 
the two with 4>h > 5°. 

Similar results were found for the quasar reconstruc- 
tions. Figure[7| shows the dispersions of the <j)A and 0a 
measurements, as functions of both S/N and quasar frac- 
tion. The 4>a and 9a dispersions are roughly twice as 
large as the corresponding galaxy values. 

3.5. Dependence on Eigenspectrum Wavelength 
Coverage 

The eigenspectra cover the wavelength ranges spanned 
by the spectra used to construct them. For the eigen- 
spectra used here, the rest frame wavelength range is 
effectively 3500 — 7000A. However, as redshift increases, 
a decreasing fraction of the eigenspectrum wavelength 
range can be used, because rest frame wavelengths are 
shifted out of the observed range. The redshift at which 
there is no more usable eigenspectrum wavelength cover- 
age is clearly an upper limit on the technique; however, 
a practical limit is reached at lower redshifts as the relia- 
bility of the reconstructed spectra decreases with shorter 
wavelength coverage. 

Figure[S] shows the rms dispersion of the galaxy classi- 
fication angles measured for a large number of simulated 
spectra, as a function of the fraction of the total eigen- 



spectrum wavelength covered. The host galaxy fraction 
and S/N level were held constant in all of the simula- 
tions. As expected, the dispersions increase with shorter 
wavelength ranges. For our purposes, the dispersions be- 
come unacceptably large at a wavelength coverage frac- 
tion of about 0.5. At that point, the dispersions are com- 
parable to those in cases with low S/N and small galaxy 
fraction. In SDSS spectra, the redshift corresponding to 
a coverage fraction of 0.5 is zu m it — 0.752, which is what 
we adopt as the upper limit for the sample selection. 

The redshift limitation is not a limitation of the tech- 
nique in general, but applies to the specific case of the 
eigenspectra used here. It is also restricted to the galaxy 
eigenspectra, since the quasar eigenspectra cover a much 
wider wavelength range. The limitation could be over- 
come by constructing galaxy eigenspectra that cover 
a wider wavelength range, especially at shorter wave- 
lengths. This requires UV spectra of low-redshift galax- 
ies, or optical spectra of galaxies at higher redshifts. 

3.6. Example Reconstructions 

Examples of fits to real spectra are shown in Fig.|5J 
The spectra were drawn from the SDSS dataset (see 
§2), and were selected from each of the six regions for 
which the template galaxy spectra were created. The 
S/N ranges from 10 to 50 and the host galaxy fractions 
vary from 0.25 to 0.71. The original spectrum, the eigen- 
spectrum reconstruction, the residual spectrum the AGN 
component, and the host galaxy component are shown 
for each object. It is evident from the figures that the 
spectra are well fit by the combined eigenspectrum sets. 
Larger residuals occur in the regions containing strong 
narrow emission lines, as expected, because a larger num- 
ber of eigenspectra is often required to describe narrow 
line features. 

4. SDSS HOST GALAXY AND AGN SPECTRA 

4.1. Decomposition of SDSS AGN Spectra 

The eigenspectrum decomposition method was applied 
to all 11,647 AGN spectra in the initial sample. The 
fits to some of the spectra resulted in components that 
extend below zero flux density in continuum regions. In- 
spection of those cases showed that the AGN compo- 
nent dominates enough that the galaxy component is 
too small to be reliably estimated, that is, the galaxy 
fraction is usually well below 0.1 — the practical limit 
discussed in § 3.4. Fits to spectra with S/N ratios less 
than about 10 are also unreliable (§3.4), so those spec- 
tra are removed for further analysis. In Fig. llOl the frac- 
tion of AGNs with S/N > 10 having a host galaxy frac- 
tion exceeding Fh = 0.1 is shown as a function of the 
AGN absolute i band magnitude. As expected, the frac- 
tion of AGNs with detectable host galaxy components 
drops with increasing luminosity. The detected fraction 
falls with luminosity in much the same way as the frac- 
tion with extended profiles in the imaging data fFig. HOfl . 
There is a close correspondence between the classification 
of an AGN image as extended, and the AGN having a 
spectroscopic host fraction Fh > 0.1. Of the AGN with 
z < 0.752, 52% are extended, and 61% have F H > 0.1. 
Of the AGN with extended images, 89% have F H > 0.1 
compared with only 30% in the AGN sample with unre- 
solved images. 
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Fig. 5. — Confidence contours encircling 90% of the results of 
reconstructions to the simulated spectra, for varying values of Fh- 
The host galaxy fraction in the simulated spectra was varied from 
0.1 to 0.9 in steps of 0.1, and the S/N level was fixed to be 10. The 
outer three contours are labeled by the values of Fu in the simu- 
lations. The angle (j>jj is generally correlated with star formation 
rate, while 9jj appears to be related to post-starburst activity. 
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Fig. 6. — The classification angles for the six host galaxy tem- 
plates with 1000 simulations of added noise. The S/N level of 
the simulated spectra was fixed at 10, and the host galaxy frac- 
tion was fixed a t Fh = 0.3. The boxes show the regions from 
which Yip ct al. ( 2004aj) selected galaxy spectra to construct the 
templates. Labels show the general trend of galaxy type with 4>h- 



Figure ^] shows the i band cmodel (total flux) ab- 
solute magnitude vs. redshift for the initial sample of 
7664 SDSS AGN with S/N > 10; squares represent the 
4666 AGNs for which there is a "reliably" detected host 
galaxy. At redshifts greater than about 0.4, the hosts 
of less luminous AGNs are often undetected because the 
S/N is low, and they are undetected in high luminosity 
AGNs because the host component fraction is too low. 
For most of the analysis that follows, we will omit AGNs 
with a host component fraction less than Fjj = 0.1, or a 
median spectroscopic S/N per pixel in the i band wave- 
length region of less than 10. 

The galaxy classification angles for the fits to the 4666 
"reliably" detected host galaxies are shown in Fig.1121 
The distribution of host galaxy angles is clustered in a 
relatively small region, as are tho se of the normal galax- 
ies analyzed bv lYip et alJ (|2004a), which are shown with 
density contours in the figure. Labels show the general 
trends of star formation and post-starburst activity with 
the classification angles; however, the labels are meant 



only to suggest the correlations, but not that there is a 
strict one-to-one correspondence. A more detailed com- 
parison of the distribution relative to normal galaxies is 
discussed in § 5. 

Information for the objects used in this study is tab- 
ulated in an electronic table, the column description for 
which is given Tabled The information is described 
in the electronic version of Tabled which includes all 
(11,647 total) SDSS broad-line AGNs with redshifts z < 
0.752 for which an eigenspectrum reconstruction was at- 
tempted. The table includes the SDSS coordinate name, 
coordinates, redshift, plate number, fiber number, spec- 
troscopic MJD, imaging morphology, spectroscopic S/N 
in the i band, magnitudes, luminosities (described in 
§4.2), host galaxy fraction (Fh), and the AGN and host 
galaxy classification angles. Default values (set to zeros) 
of the reconstructed spectral parameters are given if ei- 
ther the quasar or host galaxy reconstructed spectra had 
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Fig. 7. — The rms dispersion, in degrees, on the measurement of <f>A (left) and 8a (right) for simulated spectra with varying levels of 
S/N and host galaxy contribution fraction. The contours are labeled by the value of the dispersion. 



40 




0.0 0.2 0.4 0.6 0.8 1.0 

Wavelength coverage fraction 

Fig. 8. — The rms dispersion, in degrees, on the measurement of 
</>h and 6h for simulated spectra with varying fractions of eigen- 
spectrum wavelength coverage. The host galaxy luminosity frac- 
tion and S/N ratio were held constant, and noise was added to the 
template spectra according to the S/N level. 

negative flux densities in the 4160 — 4210A continuum 
region. The objects are listed regardless of S/N or host 
galaxy fraction, Fjj; users of the dataset can therefore 
apply any range of selection cuts to the sample. 

4.2. Luminosity Determination 

Estimates of the luminosities of the host galaxy and 
AGN components were made by measuring rest frame ab- 
solute magnitudes in the SDSS g and r passbands. Nor- 
mally, absolute magnitudes are determined by measur- 
ing the apparent magnitude of an object in an observed 
frame passband, then applying the distance modulus (us- 
ing a specific cosmology) and a K-correction, which ac- 
counts for the difference between the observed frame and 
rest frame flux density covered by a given passband. The 
wavelength range of the eigenspectra cover the SDSS g 
and r bands completely, so synthetic magnitudes can be 
constructed without the need for K-corrections. The se- 
lection of the SDSS g and r filters also allows for direct 
comparison between the results here an d those of other 
SPSS ga laxy studies, such as the ones bv lBernardi et al.1 
i2003clbft and lHogg et all ll20T)l . 



The rest frame apparent magnitudes were measured di- 
rectly from the reconstructed spectra by convolving the 
spectra with the SDSS filter transmission curves 8 , includ- 
ing 1.3 airmasses of extinction, after shifting the spectra 
to the rest frame and multiplying the flux densities by the 
rcdshift expansion factor, 1 + z. The distance modulus 
(as a function of redshift and the selected cosmology) was 
applied to the apparent magnitudes to determine the ab- 
solute magnitudes; K-corrections are not necessary. The 
absolute magnitudes of the reconstructed AGN compo- 
nent, the reconstructed host galaxy component, and the 
total spectrum of each object were determined using the 
spectroscopic convolution technique. 

The determination of the absolute magnitudes using 
the spectroscopic convolution technique is possible due to 
the high quality of the SDSS spectrophotometry 9 . The 
airmasses of the spectroscopic observations vary from 
plate to plate, so slight improvements could perhaps be 
made by adjusting the airmass of the extinction applied 
to the transmission curves. Additionally, small uncer- 
tainties could be introduced if fibers are not precisely 
centered on the active nuclei. However, a direct compar- 
ison between the stellar imaging and spectroscopic mag- 
nitudes for tens of thousands of stars in the SDSS DR3 
sample, shows an rms variation of less than 0.06 magni- 
tudes in the <?, r, and i bands at a spectroscopic S/N 
of 10. That is significantly less than the uncertainty ex- 
pected to be introduced by quasar variability. We expect 
the spectrophotomctric accuracy to be somewhat lower 
than 0.06 magnitudes for the AGN and host components 
separately, due to the uncertainties of the eigenspectrum 
reconstruction technique (§3). 

Before using the magnitudes to study the properties of 
the objects, a correction must be applied to account for 
the finite angular width of the SDSS spectroscopic fibers, 
which subtend only a 3" diameter on the sky. Not all of 
the collected light from an active nucleus or host galaxy 
will be covered by a fiber, and the fraction of the total 
intensity entering the fibers will depend on the sizes of 

8 http:/ /www. sdss.org/dr3/instruments/imagcr/ 

9 http: / / www.sdss.org/dr3 / products / spectra/spectrophotometry.html 
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Fig. 9. — Examples of eigenspectrum reconstructions of AGN/host galaxy spectra. Each panel shows the original spectrum (black), the 
reconstructed spectrum (red), the residual spectrum smoothed by 7 pixels (black, near fx = 0), the AGN component (blue), and the host 
galaxy component (green). The six examples were drawn from the <f>H and djj regions, defined by the coordinates inside panel, from which 
the galaxy template spectra were constructed. The estimated morphological type of galaxies inside each region are given. The emission 
lines of the spectra in the lower-right panel extend well-beyond the top limit of the plot. 



the images. An AGN component is spatially unresolved, 
while a host galaxy may be extended, so the correction 
factor applied to account for the finite fiber diameter will 
be different for each component. In addition, varying 
seeing conditions can affect the amount of light entering 
the spectroscopic fibers, even from unresolved sources. 

The offsets needed to correct the (unresolved) AGN 
spectroscopic magnitudes were determined by compar- 
ing the spectroscopic magnitudes of non-variable early- 
type stars observed in the SDSS spectroscopic survey to 
their so-c alled "cmodel" magnitu des. The cmodel magni- 
tudes m c ijAbazaiian et al.l2004|) . which are derived from 
model profile fits to the images, are designed to include 
nearly all of the flux of both point sources and extended 
galaxies. The mean observed frame magnitude offsets in 
the g, r, and i bands were determined for each spectro- 



scopic plate separately, to account for varying observing 
conditions. Each plate corresponds to observations of 
about 25 usable stars. The mean offsets were found by 
weighting each star by the square of its spectroscopic 
S/N, although the weights did not usually significantly 
affect the results. 

In the analysis that follows, the magnitude offsets AgA 
and ArA, were applied to each set of rest frame AGN 
absolute g and r magnitudes. Because the g, r, and i 
offsets were determined in the observed frame, the rest 
frame g and r corrections applied to the AGN magnitudes 
were interpolated from the two closest observed frame 
offsets, at the redshifted central wavelengths of the g and 
r bands. Interpolation caused only minor corrections 
to the magnitude offsets, because the mean differences 
among the offsets of the three passbands are only about 
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Fig. 10. — The fraction of AGN spectra with S/N > 10 having a 
detectable host galaxy, i.e. a galaxy fraction greater than Fb = 0.1, 
as a function of absolute i band cmodel magnitude, M%. Also shown 
is the fraction of AGNs in the sample that have extended image 
morphology as a function of Mi. 



u 
-i 

+^ 

l 
□ 



o 
m 
12 
< 




0.4 

Redshift, z 

Fig. 11. — The absolute i band cmodel magnitude, Mi, vs. red- 
shift for the SDSS AGN sample with S/N > 10, up to a redshift 
of z = 0.752. Dots represent AGNs for which a host galaxy was 
reliably detected; that is the spectra have a host galaxy fraction 
Fjj > 0.1. Contours represent the region occupied by AGNs with- 
out a reliably detected host galaxy. There are 7664 AGNs in the 
S/N > 10 sample, 4666 of which have a detected host galaxy. The 
AGNs with detectable hosts are generally at lower luminosities and 
redshifts. 



0.02 magnitudes. This is much smaller than the uncer- 
tainties between the spectroscopic and imaging epochs 
introduced by AGN variabil ity, which are typically a 
few tenths of a magnitude l|Vanden Berk et alJ H)04) , 
and become relevant when correcting the galaxy magni- 
tudes. There are contributions to the uncertainty in the 
AGN magnitude corrections from photometry, the spec- 
troscopic decomposition, and varying observing condi- 
tions. The rms dispersion of the spectroscopic to cmodel 
magnitudes for stars (noted above), places a lower limit 



Fig. 12. — The galaxy classification angles (points), <j>H an d ®Bi 
found for SDSS broad-line AGN spectra with z < 0.752, S/N > 10, 
and a host galaxy fraction Fjj > 0.1. There are 4666 AGNs in the 
sample. The contours show the distribution of the classifi cation an- 
gles for non-active galaxies from the sample described bv lYip et alJ 
( 2004a); each contour level differs by a factor of two relative to ad- 
jacent levels. Labels show the general trends of star formation rate 
and post-starburst activity with the classification angles. 



on the uncertainty of 0.06 magnitudes. We estimate the 
typical total uncertainty to be up to a few tenths of a 
magnitude. The corrected AGN absolute magnitudes 
M g ,A and M r< A, and the AgA and ArA values applied 
to the AGN spectroscopic absolute magnitudes are given 
in the electronic version of Tabled Default values (set 
to zeros) are given for the quantities if the spectroscopic 
decomposition was non-physical (see §4.1). 

A significantly larger fraction of the host flux is ex- 
pected to fall outside the fiber area, compared to the 
AGN flux. The same method used to correct the AGN 
flux cannot be used to correct the host galaxy compo- 
nent flux, because host galaxy sizes and profiles can vary 
greatly from object to object. Instead, we estimate the 
total host galaxy flux by subtracting the corrected AGN 
flux from the total photometric cmodel flux of the AGN 
plus host galaxy. In practice, the magnitude offset for a 
host galaxy, Amj, was found by estimating the ratio of 
the cmodel (subscript c) and spectroscopic (subscript s) 
fluxes of the host galaxy 



Am H = m^ H - m s ^ H (6) 
= -2.51og(/ CiH // s ^). (7) 

The host galaxy cmodel to spectroscopic flux ratio is a 
measure of the amount of host galaxy flux lost outside 
a spectroscopic fiber. The ratio cannot be measured di- 
rectly, because of the contribution of the AGN compo- 
nent, but it can be estimated by subtracting the estimate 
of the AGN flux 
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where the subscript T refers to the total of the AGN 
(subscript A) and host (subscript H) quantities, and the 
cmodel to spectroscopic flux ratios are 

S T = fc,T/fs,T (11) 
Sa = fc,A/ fs,A- (12) 

The quantity F m u is the spectroscopic host contribution 
fraction, similar to Fjj defined by Eq.JS] but using the 
flux in the rest frame g or r bands. The result for the 
host galaxy magnitude correction is 

Am H = -2.51og(5 T F-^ - S A {F-] H - 1)) . (13) 

The magnitude offsets were applied to the host galaxy 
absolute magnitudes in the analysis of the following sec- 
tions. The cmodel to spectroscopic flux ratios, Sa and 
St, can only be determined in the observed frame, so 
the values used in Ea . 11 31 were interpolated from the ob- 
served frame to the redshifted wavelengths of the g and 
r bands. While the corrected magnitudes should give 
an accurate estimate of the luminosities for the sample 
as a whole, the values for individual cases may be less 
reliable due to measurement uncertainties, unaccounted 
for differences in observing conditions, AGN variability, 
and inaccuracies in the spectral reconstruction. The typ- 
ical uncertainty from those sources is estimated to be a 
few tenths of a magnitude, but larger than the uncer- 
tainty for the AGN measurements. In a small number 
of cases, the "corrections" make the total host luminosi- 
ties fainter than the spectroscopic luminosities. The re- 
sult of the uncertainties is to broaden the distribution 
of luminosities. In any case, the corrected host galaxy 
absolute magnitudes are listed in the electronic version 
of Tabled along with the magnitude offsets Agn and 
Arii that were applied to the spectroscopic magnitudes, 
so that the results may be compared between the spec- 
troscopic and corrected luminosities. Default values (set 
to zeros) are given for the quantities if the spectroscopic 
decomposition was non-physical (see §4.1). 

5. RESULTS 

5.1. Comparison of AGN and Host Galaxy Luminosities 

It was shown in § 4.1 that the fraction of AGNs with a 
detectable host galaxy component (Fji > 0.1) decreases 
with AGN luminosity. Fig. El shows the host galaxy ab- 
solute magnitudes vs. the AGN absolute magnitudes for 
all of the objects with a spectroscopic S/N > 10. The 
AGN and galaxy luminosities appear to be closely corre- 
lated in both the g and r bands. This is due in part to the 
increased difficulty of detecting a host galaxy as its frac- 
tional contribution to the spectra decreases. The dashed 
line shows where the spectroscopic host luminosity con- 
tributes 10% to the total luminosity, adjusted for the 
average magnitude changes to the AGN and host com- 
ponents (§4.2), and corresponds closely to the galaxy 
fraction reliability limit, Fh — 0.1 discussed in §3.4. 
The selection function, that is, the fraction of spectra in 
which a host galaxy could be detected, decreases toward 
that limit. Points may extend beyond the limit (and 
a few do), depending upon the spectroscopic to cmodel 
magnitude corrections described in §4.2. A correlation 
between the host and AGN luminosities cannot be es- 
tablished without a more careful analysis of the selection 
function (which we defer to future work). However, if 



the (unplotted) AGNs with hosts outside of the detec- 
tion limits radiate at less than the Eddington limit (see 
below), there would be a very strong correlation between 
the AGN and host galaxy luminosities. 

Also shown in Fig.^D ar e the evolution corrected 
characteristic absolute magnitudes for early type galax- 
ies (without broad emission line compo nents), M„., = 
—20.70 and M r ,* = —21.35, as found bv iBernardi et alJ 
(2003c) for the median redshift of our sample, (z) = 
0.236. The median values of the host absolute magni- 
tudes are (M gM ) = -22.03 and (M r>H ) = -22.62, which 
are clearly more luminous than typical elliptical galaxies. 
There is also a wide distribution in the host luminosities 
at every AGN luminosity. 

A number of image decomposition studies have found 
that there is a lower threshold to the galaxy luminosity 
for a given AGN luminosity. This host- AGN luminosity 
threshold is thought to arise from scaling relationships 
among bulge luminosity, bulge mass, bla ck hole mass, 
and Eddington luminosity. For example, Dunlop et al. 
( 2003) found that quasars are typically radiating at 1- 
10% of their Eddington luminosities, which in effect es- 
tablishes a scaling relation between AGN luminosity and 
black hole mass. As there is also a correlation between 
black hole mass and bulge mass and luminosity, one ex- 
pects to find a scaling relation between AGN and host 
galaxy luminosity. Thus, a given AGN luminosity re- 
quires a host galaxy that is sufficiently massive and lu- 
minous to harbor a black hole of the mass inferred from 
the Eddington ratio. 

We have plotted the expected relationships between 
AGN and host luminosities for the r band in Fig. 1131 
for Eddington luminosity ratios LA/LEdd — 1.0,0.1, 
and 0.01. The relationship between galaxy luminosity 
and black hole mass was estimated by assuming the r 
band b ulge mass-to-light ratio found bv IBernardi et all 
( 2003 d), a black ho le mass to bulge mass of 0.0012 (e.g. 
McL ure &: Dunlo'p|l2002[l . and assuming that the host lu- 
minosity is due entirely to a bulge component. The re- 
lationship between black hole mass and AGN luminosity 
was estimated by assuming a 10% accretion radiation 
efficiency for the bolomctric Eddington luminosity, and 
using the V band to bolometric correction factor given by 
lElvis et all l)1994fl (which should be close to the r band 
factor within the uncertainties). Equating the black hole 
masses inferred from the galaxy luminosity and AGN lu- 
minosity gives the relationship between the two lumi- 
nosities as a function of the Eddington ratio. For most 
spectra, the sensitivity limits of our study do not extend 
to an Eddington ratio of unity, but only to L A / ' i^Edd of 
a few tenths. The vast majority of our AGNs appear to 
be radiating at greater than O.QlLEdd- It is possible that 
the sample is missing low-luminosity AGNs with broad 
emission lines that are too weak to be detected in the 
SDSS spectra. Those objects could have LA/LEdd much 
smaller than 0.01, and therefore could populate the up- 
per left regions of the plots in Fig.^3 Until such objects 
can be confirmed or ruled out, we cannot conclude that 
there is a lower limit to LA/LEdd- 

Because we have spectroscopic information for the 
AGN components, it will be possible in many cases to 
derive estimates for the black hole masses based on the 
AGN broad emission line widths and the relationship be- 
tween broad emission line region size and luminosity; we 
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Fig. 13. — Absolute g band (left) and r band (right) magnitudes of the AGN and host galaxy components, Mff and Mjy respectively. 
The dashed diagonal lines marked Fjj = 0.1 correspond to a host galaxy fraction of 0.1 in each band, adjusted for the average correction 
from spectroscopic to cmodel magnitudes, and closely mark the sensitivity limits for the detection of host galaxies. The horizontal lines 
show the evolution corrected characteristic magnitude of early type galaxies without broad-line AGNs, M t . The solid diagonal lines in the 
r band plot show the host luminosity at which an AGN would be expected to radiate at 0.01, 0.1, and 1.0 times the Eddington luminosity, 
LEdd- 
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Fig. 14. — Rest frame host galaxy color vs. host galaxy absolute 
magnitude in the r band, M r jj. The median value of the color 
in bins of 0.2 magnitudes is shown along with the 68.3% confi- 
dence limits. The dashed line shows the color-absolute magnitu de 
relation for early type galaxies found by Bcrnardi ct al. (2003c). 



leave these issues to be addressed in future work. In any 
case, we can conclude that AGNs may reside in hosts 
with a wide range of luminosities, but that in general 
the hosts of AGNs with M < — 18 are significantly more 
luminous than galaxies without strong broad line AGNs. 

5.2. Host Galaxy Colors 

The host galaxy rest frame (g — r) color is shown as 
a function of host absolute magnitude, M r> H> m Fig.1141 
For clarity, the mean colors and 68.3% confidence enve- 



lope in bins 0.2 magnitudes wide are also shown, con- 
nected by lines. There appears to be a negative correla- 
tion of host color with luminosity, such that the hosts 
become bluer at higher luminosities. There is also a 
fairly wide distribution of colors at any given luminos- 
ity. Aside from measurement uncertainties, it is possible 
that the the wide distribution of colors and the luminos- 
ity trend are due to a wide distribution of galaxy types 
that changes with lu minosity, because the colors of galax- 
ies differ by type (cf. IHogg et al.l l:2004) . The straight line 
in Fig. ll4l shows th e color-luminosity relat ion for elliptical 
galaxies found bv IBernardi et alJ |2003a), which is sim- 
ilar tc^_the_jdation_for bulge-dominated galaxies found 
bv IHogg et alJ l)2004[) . The range of host galaxy colors 
is much wider than the dispersion around the elliptical 
color- luminosity relation, w hich is less than 0.1 magni- 
tude ijBernardi et al.ll20 03a) . The color line for elliptical 
galaxies describes the host galaxy colors reasonably well 
at low luminosities (M r> H > —21.5), but it becomes in- 
creasingly redder than the majority of the hosts at high 
luminosity. This implies that if host galaxies are mainly 
ellipticals or otherwise bulge-dominated, for a given lumi- 
nosity they are bluer than the same types of galaxies that 
do not contain an observed AGN. The results may in- 
stead mean that the types of galaxies that contain AGNs 
change with luminosity, such that disk-dominated hosts 
become more common at higher luminosity. However, 
not only do most image decomposition studies find that 
the majority of host galaxies are bulge-dominated, but it 
also appears that bulge-do minated hosts are m ore com- 
mon at higher luminosities (DunloD et al.ll2003(l . Multi- 
band photometric imaging studies also support the idea 
that the hos t galaxies are bluer than their non-active 
counterparts j Hutchings et all200"l iSanchez et al.ll2004t 
Uahnke et al1l2004a|) . 
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Fig. 16. — Distribution of the host galaxy (fin angle values for 
spectra with S/N > 10 and Fjj > 0.1 (solid histogram). The 
dashed histogram s hows th e 4>h distribution for normal galaxies, 
found bv lYip et aT] I2004al) . 



Fig. 15. — Host galaxy rest frame g — r color vs. 4>H angle values 
for spectra with S/N > 10 and F H > 0.1. 



iBernardi et alJ l)2003aj) showed that the color luminos- 
ity relation for early type galaxies is a consequence of a 
strong dependence of color on stellar velocity dispersion. 
Based on this, we would expect the redder hosts in our 
sample at a given luminosity to have a higher velocity 
dispersion, and consequently a larger black hole mass. 
However, if the star formation rate is related to the pres- 
ence of an AGN, that prediction may not hold. There 
is evidence from the eigcncocfhcicnts, described in § 5.3, 
that the hosts are dominated by post-starburst galaxies, 
which would likely change both the color-velocity dis- 
persion and color-luminosity relation from those of non- 
active ellipticals. Further analysis in the future of the 
spectroscopic components to estimate velocity dispersion 
and black hole mass should help shed light on these is- 
sues. 

The nuclear colors of an AGN are expected to change 
through time in a num ber of possible evolutionary se- 
quences. For example iStorchi-Bergmann et all (2001) 
describe a scenario in which a merger event would both 
fuel AGN activity, and trigger circumnuclear star forma- 
tion early in the process. Dust associated with the star 
formation may reprocess the nuclear light, making it ap- 
pear red, so that in this early phase the galaxy would 
be classified as an ultraluminous infrared gala xy (e.g. 
ICid Fernandes et alJl200ll iBushouse et alJl2002|) . With 
time, dust is removed and the starburst fades, so that an 
unobscured AGN combined with an older stellar popu- 
lation dominate the nucleus. The objects in our study 
may be in such a later evolutionary phase, as long as the 
aged nuclear stellar population is younger than it would 
have been in the absence of the triggering event. 

5.3. Host Galaxy Spectroscopic Classification 

Further information about the classification of the host 
galaxies and the AGNs is contained in the eigencoef- 
ficients. The relative strengths of the eigencoemcients 
contain information about the physical properties of the 
galaxy and AGN components. While there has yet been 
no quantitative calibration of the eigenspectra with phys- 



ical parameters, there are clear qualitative correlations. 
The first few AGN eigenspectra are related to the so- 
called "eigenvectors 1 and 2" in observed parameter 
space as defined bv iBoroson fc Greenl (^)92), and it is 
believed that the eigenvect ors are indicato rs of black hole 
mass and accretion rate (Boroson 2002). The galaxy 
classification angles, 4>h and Ojj, are related to star for- 
mation rate an d the presence of post-starburst activ- 
ity respectively jYip et al.ll2004al iConnollv et~al"]ll995t 
iGastander et alJ 120011) . FigureEl shows how the g — r 
host galaxy colors, discussed in §§5.1 & 5.2, are related 
to the first galaxy classification angle, 4>h- The angle 
4>h and the host color are correlated, as expected given 
the qualitative relation between (f>H and star formation 
rates. 

Fig-El shows the distribution of 4>h for all of the re- 
liably detected host galaxies with Fh > 0.1 and spec- 
troscopic S/N > 10. Also shown is t he 4>h dis t ributio n 
for the normal galaxies analyzed by lYip et all (|2004a) . 
There are clear differences in the distributions. The host 
galaxy distribution is shifted to slightly more negative 
(bluer color) values, and there is a much less pronounced 
(and blucward shifted) narrow peak at 4>h ~ 5° com- 
pared to the inactive galaxies. The distribution of the 
host (fin values probably indicates that the host star 
formation rates are generally higher than for galaxies 
without AGNs. That would be true especially if the 
morphology distribution of the hosts relative to other 
galaxies is weighted toward ellipti cals, as indicated in 
a number of imaging studies (e.g. i Jahnke et al.ll2 004a ; 
i Sanchez et"aTll2004t lHamilton et alJl2002t iPercival et alJ 
l20mHScnade et a ,Ul200q>. 

There is a much more dramatic difference in the dis- 
tributions of the second galaxy classification angle, 0h, 
shown in Fig. 1171 The inactive galaxy distribution is 
strongly peaked at 9h ~ 89°, while the host galaxy dis- 
tribution is not only shifted to higher angles (a peak at 
6h ~ 93°), but is much wider, and extends to angles at 
which there are virtually no inactive galaxies. A possible 
interpretation is that the population of AGN hosts con- 
tains a relatively large fraction of post-starburst galaxies. 
Post-starburst (also called E+A) galaxies are character- 
ized by Balmer jumps and high-order Balmer absorption 
lines, indicative of stellar populations of order 100 Myr 
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Fig. 17. — Distribution of the host galaxy Oh angle values for 
spectra with S/N > 10 and Fh > 0.1 (solid histogram). The 
dashed histogram s hows th e <j>H distribution for normal galaxies, 
found bv lYip et aT] I2004al) . 



old, but with little or no narrow emission lines that woul d 
imply current star formation l)Dressler fc Guni] ll983). 
The third galaxy eigenspectrum has properties si milar to 
those of post-starburst galaxies ijYip et al.l2004al) : a blue 
continuum and strong Balmer absorption features, im- 
plying a relatively young stellar population, but also fea- 
tures anticorrelated with the continuum which reduce the 
narrow emission lines, and therefore indicate suppressed 
star formation. Post-starburst activity may therefore 
be indicated by Oh which is a measure of the relative 
strength of the third galaxy eigenspectrum. A strong 
third eigenspectrum does not mean that there is no cur- 
rent star formation in a galaxy, nor that the host galaxy 
can be classified as a post-starburst, only that there is 
a component of the galaxy spectrum that resembles a 
post-starburst galaxy. 

Spectacular examples of post-starburst AGNs (broad- 
line AGNs associated with a p ost-starburst host ) have 
been studied in some detail llBrotherton et all [l999; 
iCanalizo et"aLll2000l iBrotherton et alJl2002j) . In a study 
of so-called "transition" AGNs — objects with character- 
ist ics of both AGNs and ultra-lu minous infrared galaxies 
- ICanalizo fc Stockton! l)2001|) found nearly all of the 
host galaxies had post-starburst spectra. Post-starburst 
AGNs virtually always show morphological evidence for 
recent mergers or other interactions; however, the evo- 
lutionary stages of the AGNs and the relation to the 
post-starburst activity is not clear. If Oh is an indica- 
tor of post-starburst activity which is somehow linked to 
triggering AGNs, the link may be reflected in a relation 
between Oh and black hole mass, as the black holes in 
young AGNs would be expected to be less massive than 
in older AGNs for a given accretion rate. 

Assuming the eigenspectra are physically meaningful, 
connections among the AGN and host components may 
be evident in the eigencoefficients. The AGN and host 
galaxy classification angles are plotted against each other 
in Fig.^l The points have been restricted to objects for 
which Fh > 0.3 and S/N > 10, in order to include only 
very reliable estimates of the angles. There are signif- 
icant correlations among all of the classification angles, 
except for <Ph and a with a Spearman rank correlation 
coefficient of —0.019 and a probability of about 32% for 
2678 degrees of freedom. The three other classification 



angle pairs, 4>h vs. <Pa, Oh vs. 4>a, and Oh vs. Oa, have 
rank correlation coefficients 0.139, 0.292, and 0.321, re- 
spectively, and probabilities of occurring by chance of 
much less than 1% in each case. The mild but significant 
correlations among the host galaxy and quasar classifica- 
tion angles may link recent star formation to the physical 
properties of the AGNs. 

6. DISCUSSION & SUMMARY 

The results presented here suggest that eigenspectrum 
decomposition has the potential to be a powerful tech- 
nique for investigating the properties of broad- line AGNs 
and their host galaxies. We have shown that the tech- 
nique can efficiently and reliably separate the spectro- 
scopic components in composite spectra under certain 
conditi ons. The approac h is similar in some respects to 
that of lHao et alJ l)2005a|L who used eigenspectra of pure 
absorption-line galaxies to remove the stellar component 
of active galaxies. Here we have used sets of both galaxy 
and AGN eigenspectra to reconstruct each component, 
which allows the analysis of all the continuum, emission, 
and absorption features. A large number of broad-line 
AGN spectra in the SDSS dataset are suitable for this 
technique, but there is no reason it cannot be used for any 
set of optical/UV flux calibrated AGN spectra. There 
are limitations dependent on the S/N ratio, the frac- 
tional contribution of the host galaxy, and the redshift 
(§3). The redshift limit is imposed only by the wave- 
length coverage of the galaxy eigenspectra, so it should 
be possible to extend the technique to higher redshifts by 
constructing eigenspectra with shorter wavelength cover- 
age, or obtaining AGN spectra at near-IR wavelengths. 

The reconstructed spectra of over 4600 broad-line 
AGNs in the SDSS show that the host galaxies span a 
wide range of spectral classes, but the distributions of 
their classification angles show marked differences from 
their inactive counterparts. In particular, the values of 
the second classification angle indicate strong evidence 
for post-starburst activity in a large fraction of the hosts. 
Assuming that the host galaxies are predominantly ellip- 
ticals or otherwise bulge-dominated, the host population 
appears to be drawn mainly from the high luminosity 
end of the luminosity function. The colors of the hosts 
as a function of luminosity also become bluer than ex- 
pected for inactive elliptical or bulge-dominated galaxies, 
as the host luminosity increases. These results can be 
qualitatively explained by black- hole/bulge mass scaling 
relationships evident at low redshift, along with a link be- 
tween recent star formation and the formation of AGNs. 
For typical SDSS spectra, the sensitivity limits of the 
technique allow us to probe up to AGN Eddington ra- 
tios a few tenths. At the highest S/N levels, there are 
still virtually no AGNs that appear to emit beyond the 
Eddington limit. 

Our results show that AGNs can reside in host galaxies 
with a wide variety of spectral classes, and generally con- 
firm previous imaging studies that show that host galax- 
ies tend to have bluer colors than expected for inactive 
galaxies. The imaging and spectroscopic decomposition 
techniques are complementary, and there is much more 
information in the spectra than we have presented in this 
paper. Our focus here has mainly been to illustrate the 
eigenspectrum decomposition technique, and to highlight 
some initial results of its application to SDSS broad-line 
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Fig. 18. — The relationships among the AGN and host galaxy classification angles. Contours are spaced by density differences of 
approximately 20% of the peak value. Points are shown if they fall, or nearly fall, outside of the lowest density contour. There are mild 
but significant correlations among the classification angles, except for 8 a vs. <f>H- 



AGN data. There are a number of directions for future 
work, including estimates of black hole masses, the spec- 
tral properties of the galaxies and AGNs, the luminosity 
functions of both the AGNs and host galaxies, and vari- 
ations of the object properties among classes. 
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TABLE 1 

AGN and Host Galaxy Information Table Format 



Column 


Format 


Description 


1 


A18 


Object Designation hhmmss.ss+ddmmss.s (J2000; truncated coordinates) 


2 


F11.6 


H.A. (J2U0U, degrees] 


3 


F11.6 


Dec. (J2U0U, degrees) 


4 


F7.4 


Redshift 


5 


15 


C\ i ■ Til J M 1 

spectroscopic Plate Number 


6 


11 


Spectroscopic Fiber Number 


7 


16 


Modified Julian Date of spectroscopic Observation 


8 


A12 


Morphology (resolved or unresolved) 


9 


F6.2 


Average spectroscopic b/JN Per Pixel m the l Band 


10 


r 7.3 


g Point Spread Function (PSF) Magnitude 


11 


F6.3 


g PSF Magnitude Error 


12 


F7.3 


r Point Spread Function (PSF) Magnitude 


13 


F6.3 


r PS1 Magnitude terror 


14 


F7.3 


l Point spread 1 unction (Psl ) Magnitude 


15 


F6.3 


i PSF Magnitude Error 


16 


F7.3 


g Spectroscopic Magnitude 


17 


F7.3 


r Spectroscopic Magnitude 


18 


F7.3 


i Spectroscopic Magnitude 


19 


F7.3 


g Cmodcl Magnitude 


20 


F6.3 


g Cmodcl Magnitude Error 


21 


F7.3 


r Cmodcl Magnitude 


22 


F6.3 


r Cmodcl Magnitude Error 


23 


F7.3 


i Cmodel Magnitude 


24 


F6.3 


i Cmodcl Magnitude Error 


25 


F7.3 


i_> Cl 1 1 VJJOjl<XL, fi UoUi l_/ Lr 1 W 1 1 I lllditilll Lr L1L1\_ o / 


26 


F7.3 


r Band Galactic Absorption (magnitudes) 


27 


F7.3 


i Band Galactic Absorption (magnitudes) 


28 


F7.2 


g Band AGN Component Absolute Magnitude 


29 


F7.2 


r Band AGN Component Absolute Magnitude 


30 


F7.2 


g Band Host Galaxy Absolute Magnitude 


31 


F7.2 


r Band Host Galaxy Absolute Magnitude 


32 


F7.2 


g Band Correction Applied to AGN Absolute Magnitude 


33 


F7.2 


r Band Correction Applied to AGN Absolute Magnitude 


34 


F7.2 


g Band Correction Applied to Host Absolute Magnitude 


35 


F7.2 


r Band Correction Applied to Host Absolute Magnitude 


36 


F5.2 


Flux Fraction from 4160-4210A from Host Galaxy 


37 


F6.1 


AGN First Classification Angle (degrees) 


38 


F6.1 


AGN Second Classification Angle (degrees) 


39 


F6.1 


Host Galaxy First Classification Angle (degrees) 


40 


F6.1 


Host Galaxy Second Classification Angle (degrees) 



TABLE 2 

AGN and Host Galaxy Information 3 - 



Object 


a J2000 


<5j2000 


Redshift 


Plate 


Fiber 


MJD 


Morphology 


S/Ni 




M r , A h 




M r , H b 




4>A° 




4>H b 


e„ b 


(SDSS J) 


(deg) 


(deg) 




















(deg) 


(deg) 


(deg) 


(deg) 


000011.41+145545.6 


0.047547 


14.929353 


0.4596 


0750 


499 


52235 


unresolved 


9.88 


-22.39 


-22.73 


-22.55 


-22.97 


0.17 


4.7 


93.9 


-7.4 


93.4 


000011.96+000225.3 


0.049842 


0.040372 


0.4790 


0387 


200 


51791 


unresolved 


18.35 


0.00 


0.00 


0.00 


0.00 


0.00 


0.0 


0.0 


0.0 


0.0 


000043.95-091134.9 


0.183138 


-9.193035 


0.4388 


0650 


538 


52143 


unresolved 


17.77 


-23.38 


-23.73 


-22.77 


-23.25 


0.08 


2.2 


92.3 


-8.2 


85.9 


000048.15-095404.1 


0.200661 


-9.901158 


0.2057 


0650 


494 


52143 


resolved 


13.04 


-19.65 


-20.13 


-21.51 


-22.11 


0.68 


14.1 


85.1 


-10.2 


96.3 


000102.18-102326.9 


0.259117 


-10.390822 


0.2943 


0650 


166 


52143 


unresolved 


18.56 


-22.20 


-22.77 


-22.33 


-21.57 


0.19 


-4.6 


82.7 


-32.7 


95.1 



a Tableful is presented in its entirety in the electronic edition of the Astronomical Journal. A portion is shown here regarding its form and content. The full catalog contains 40 columns of 
information for 11648 broad-linc AGNs. 
^Default values, set to zeros, are given for entries with a non-physical spectroscopic decomposition. 



